An Analysis of Quefrency Selective Temporal Smoothing of the Cepstrum in Speech Enhancement

نویسندگان

  • Dirk Mauler
  • Timo Gerkmann
  • Rainer Martin
چکیده

Smoothing selected cepstral coefficients over time has been recently shown to be a powerful method in the enhancement of noisy speech signals. A difficulty that arises in this context is that averaging a random variable in the log-domain changes its mean in the linear domain. The knowledge of this bias is indispensable for most temporal cepstrum smoothing applications. To date, the bias is known only for Gaussian distributed signals and infinite averaging length. This contribution presents an analytic solution for signals that are recursively smoothed in the cepstral domain with arbitrary effective averaging lengths. Additionally, the correct bias is computed also for a quefrency dependent setting of the recursive averaging parameters.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust Asr in Reverberant Environments Using Temporal Cepstrum Smoothing for Speech Enhancement and an Amplitude Modulation Filterbank for Feature Extraction

This paper presents techniques aiming at improving automatic speech recognition (ASR) in single channel scenarios in the context of the REVERB (REverberant Voice Enhancement and Recognition Benchmark) challenge. System improvements range from speech enhancement over robust feature extraction to model adaptation and word-based integration of multiple classifiers. The selective temporal cepstrum ...

متن کامل

Speaker identification using cepstrum in Kannada language

Speaker identification in forensic cases involves careful estimation of the features that are more specific to speaker. The anatomical differences in vocal tract depict speaker related differences. Cepstrum has been investigated as a possible parameter for speaker identification. Fundamental frequency obtained from cepstral coefficients indirectly depicts the shape and size of vocal tract. In t...

متن کامل

Combination of MVDR beamforming and single-channel spectral processing for enhancing noisy and reverberant speech

This paper presents a system aiming at joint dereverberation and noise reduction by applying a combination of a beamformer with a single-channel spectral enhancement scheme. First, a minimum variance distortionless response beamformer with an online estimated noise coherence matrix is used to suppress noise and reverberation. The output of this beamformer is then processed by a single-channel s...

متن کامل

Joint Dereverberation and Noise Reduction Using Beamforming and a Single-channel Speech Enhancement Scheme

The REVERB challenge provides a common framework for the evaluation of speech enhancement algorithms in the presence of both reverberation and noise. This contribution proposes a system consisting of a commonly used combination of a beamformer with a single-channel speech enhancement scheme aiming at joint dereverberation and noise reduction. First, a minimum variance distortionless response be...

متن کامل

Minimum mean squared error based warped complex cepstrum analysis for statistical parametric speech synthesis

This paper presents an approach for complex cepstrum analysis based on the minimum mean squared error criterion, and describes its application to statistical parametric speech synthesis. The proposed method alleviates some of the issues associated with conventional complex cepstrum analysis, such as choice of the window, phase unwrapping, and the need for accurate pitch marks. Given initial est...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008